Introduction to Text Visualization by Nan Cao & Weiwei Cui

Introduction to Text Visualization by Nan Cao & Weiwei Cui

Author:Nan Cao & Weiwei Cui
Language: eng
Format: epub
Publisher: Atlantis Press, Paris


5. Visualizing Document Content

Nan Cao1 and Weiwei Cui2

(1)IBM T. J. Watson Research Center, Yorktown Heights, New York, USA

(2)Microsoft Research Asia, Beijing, China

Nan Cao

Email: [email protected]

Abstract

Text is primarily made of words and always meant to contain content for information delivery. Content analysis is the earliest established method of text analysis (Holsti et al., The handbook of social psychology, vol 2, pp 596–692, 1968 [55]). Although studied extensively and systematically by linguists, related disciplines are roughly divided into two categories, structure and substance, according to their subjects of study (Ansari, Dimensions in discourse: elementary to essentials. Xlibris Corporation, Bloomington, 2013 [9]). Structure is about the surface characteristics that are visible for a valid text, such as word co-occurrence, text reuse, and grammar structure. On the other hand, substance is the umbrella term for all information that needs to be inferred from text, such as fingerprinting, topics, and events. Various techniques have been proposed to analyze these aspects. In this chapter, we will briefly review these techniques and the corresponding visualization systems.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.